RAW SEQUENCE LISTING DATE: 09/03/2004 

PATENT APPLICATION: US/09/937 , 899A TIME: 10:33:43 



Input Set : A:\lllSEQ.txt 

Output Set: N:\CRF4\09032004\l937899A.raw 

3 <110> APPLICANT: Koulu, Markku 

4 Karvonen, Matti 

5 Pesonen, Ullamari 

6 Uusitupa, Matti 

8 <120> TITLE OF INVENTION: Diagnosis of a Person's Risk of Developing Alcohol 

9 Based on Leucine7 to Proline 7 Polymorphism in the 

10 Prepro -Neuropeptide Y Gene, and methods for the 

11 prevention and Treatment of Alcoholism. 
13 <130> FILE REFERENCE: 2630-111 

15 <140> CURRENT APPLICATION NUMBER: 09/937, 899A 

16 <141> CURRENT FILING DATE: 2001-09-28 

18 <150> PRIOR APPLICATION NUMBER: PCT/FI00/00260 

19 <151> PRIOR FILING DATE: 2000-03-29 

21 <150> PRIOR APPLICATION NUMBER: US 09/291,994 

22 <151> PRIOR FILING DATE: 1999-04-15 
24 <160> NUMBER OF SEQ ID NOS : 9 

2 6 <170> SOFTWARE: Patent In Ver . 2.0 

28 <210> SEQ ID NO: 1 

29 <211> LENGTH: 325 

30 <212> TYPE: DNA 

31 <213> ORGANISM: Homo sapiens 

33 <400> SEQUENCE: 1 

34 ccgcttcttc aggcagtgcc tggggcggga gggttggggt gtgggtggct ccctaagtcg 60 

3 6 acactcgtgc ggctgcggtt ccagccccct ccccccgcca ctcaggggcg ggaagtggcg 12 0 
38 ggtgggagtc acccaagcgt gactgcccga ggcccctcct gccgcggcga ggaagctcca 180 
40 taaaagccct gtcgcgaccc gctctctgca ccccatccgc tggctctcac ccctcggaga 240 
42 cgctcgcccg acagcatagt acttgccgcc cagccacgcc cgcgcgccag ccaccgtgag 300 
44 tgctacgacc cgtctgtcta ggggt 325 

47 <210> SEQ ID NO: 2 

48 <211> LENGTH: 247 

49 <212> TYPE: DNA 

50 <213> ORGANISM: Homo sapiens 

52 <40 0> SEQUENCE: 2 

53 cccgtccgtt gagccttctg tgcctgcaga tgctaggtaa caagcgactg gggctgtccg 60 
55 gactgaccct cgccctgtcc ctgctcgtgt gcctgggtgc gctggccgag gcgtacccct 120 
57 ccaagccgga caacccgggc gaggacgcac cagcggagga catggccaga tactactcag 180 
59 cgctgcgaca ctacatcaac ctcatcacca ggcagaggtg ggtgggaccg cgggaccgat 240 
61 tccggga 247 

64 <210> SEQ ID NO: 3 

65 <211> LENGTH: 142 

66 <212> TYPE: DNA 

67 <213> ORGANISM: Homo sapiens 
6 9 <400> SEQUENCE: 3 




ENTERED 
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RAW SEQUENCE LISTING DATE : 09/03/2004 

PATENT APPLICATION: US/09/937 , 899A TIME: 10:33:44 

Input Set : A:\lllSEQ.txt 

Output Set: N:\CRF4\09032004\l937899A.raw 

70 acttgcttta aaagactttt ttttttccag atatggaaaa cgatctagcc cagagacact 60 
72 gatttcagac ctcttgatga gagaaagcac agaaaatgtt cccagaactc ggtatgacaa 12 0 
74 ggcttgtgat ggggacattg tt 142 

77 <210> SEQ ID NO: 4 

78 <211> LENGTH: 300 
7 9 <212> TYPE : DNA 

80 <213> ORGANISM: Homo sapiens 

82 <400> SEQUENCE: 4 

83 ccttacatgc tttgcttctt atgttttaca ggcttgaaga ccctgcaatg tggtgatggg 60 
85 aaatgagact tgctctctgg ccttttccta ttttcagccc atatttcatc gtgtaaaacg 120 
87 agaatccacc catcctacca atgcatgcag ccactgtgct gaattctgca atgttttcct 180 
89 ttgtcatcat tgtatatatg tgtgtttaaa taaagtatca tgcattcaaa agtgtatcct 240 
91 cctcaatgaa aaatctatta caatagtgag gattattttc gttaaactta ttattaacaa 300 

94 <210> SEQ ID NO: 5 

95 <211> LENGTH: 551 

96 <212> TYPE: DNA 

97 <213> ORGANISM: Homo sapiens 

99 <220> FEATURE: 

100 <221> NAME /KEY : CDS 

101 <222> LOCATION: (87) . . (377) 

103 <220> FEATURE: 

104 <221> NAME/KEY: sig_peptide 

105 <222> LOCATION: (87) . . (170) 

107 <400> SEQUENCE: 5 

108 accccatccg ctggctctca cccctcggag acgctcgccc gacagcatag tacttgccgc 60 
110 ccagccacgc ccgcgcgcca gccacc atg eta ggt aac aag cga ctg ggg ctg 113 



111 
















Met Leu Gly Asn Lys Arg Leu Gly Leu 




112 


















1 






5 










114 


tec 


gga 


ctg 


acc 


etc 


gec 


ctg 


tec 


ctg 


etc 


gtg 


tgc ctg 


ggt 


gcg 


ctg 


161 


115 


Ser 


Gly 


Leu 


Thr 


Leu 


Ala 


Leu 


Ser 


Leu 


Leu 


Val 


Cys Leu 


Gly 


Ala 


Leu 




116 


10 










15 










20 






25 




118 


gec 


gag 


gcg 


tac 


ccc 


tec 


aag 


ccg 


gac 


aac 


ccg 


ggc gag 


gac 


gca 


cca 


209 


119 


Ala 


Glu 


Ala 


Tyr 


Pro 


Ser 


Lys 


Pro 


Asp 


Asn 


Pro 


Gly Glu 


Asp 


Ala 


Pro 




120 










30 










35 






40 






122 


gcg 


gag 


gac 


atg 


gee 


aga 


tac 


tac 


teg 


gcg 


ctg 


cga cac 


tac 


ate 


aac 


257 


123 


Ala 


Glu 


Asp 


Met 


Ala 


Arg 


Tyr 


Tyr 


Ser 


Ala 


Leu 


Arg His 


Tyr 


He 


Asn 




124 








45 










50 








55 








126 


etc 


ate 


ace 


agg 


cag 


aga 


tat 


gga 


aaa 


cga 


tec 


age cca 


gag 


aca 


ctg 


305 


127 


Leu 


He 


Thr 


Arg 


Gin 


Arg 


Tyr 


Gly 


Lys 


Arg 


Ser 


Ser Pro 


Glu 


Thr 


Leu 




128 






60 










65 








70 










130 


att 


tea 


gac 


etc 


ttg 


atg 


aga 


gaa 


age 


aca 


gaa 


aat gtt 


ccc 


aga 


act 


353 


131 


He 


Ser 
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Leu 


Leu 


Met 


Arg 


Glu 


Ser 


Thr 


Glu 


Asn Val 


Pro 


Arg 


Thr 




132 




75 










80 










85 








134 


egg 


ctt 


gaa 


gac 


cct 


gca 


atg 


tgg 


tgatgggaaa tgagacttgc tctctggcct 


407 


135 


Arg 


Leu 


Glu 


Asp 


Pro 


Ala 


Met 


Trp 


















136 


90 










95 























138 tttcctattt tcagcccata tttcatcgtg taaaacgaga atccacccat cctaccaatg 467 
140 catgcagcca ctgtgctgaa ttctgcaatg ttttcctttg tcatcattgt atatatgtgt 527 
142 gtttaaataa agtatcatgc attc 551 
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RAW SEQUENCE LISTING DATE: 09/03/2004 

PATENT APPLICATION: US/09/937 , 89 9A TIME: 10:33:44 



Input Set : A:\lllSEQ.txt 

Output Set: N:\CRF4\09032004\l937899A.raw 

145 <210> SEQ ID NO: 6 

146 <211> LENGTH: 97 

147 <212> TYPE: PRT 

148 <213> ORGANISM: Homo sapiens 
150 <400> SEQUENCE: 6 



151 


Met 


Leu 


Gly Asn 


Lys 


Arg 


Leu 


Gly 


Leu 


Ser 


Gly Leu Thr 


Leu 


Ala 


Leu 


152 


1 








5 










10 








15 




154 


Ser 


Leu 


Leu 


Val 


Cys 


Leu 


Gly Ala 


Leu 


Ala 


Glu 


Ala Tyr 


Pro 


Ser 


Lys 


155 








20 










25 








30 




157 


Pro 


Asp 


Asn 


Pro 


Gly 


Glu 


Asp 


Ala 


Pro 


Ala 


Glu 


Asp Met 


Ala 


Arg 


Tyr 


158 






35 










40 








45 




160 


Tyr 


Ser 


Ala 


Leu Arg 


His 


Tyr 


He 


Asn 


Leu 


He 


Thr Arg 


Gin 


Arg 


Tyr 


161 




50 










55 










60 




163 


Gly Lys 


Arg 


Ser 


Ser 


Pro 


Glu 


Thr 


Leu 


He 


Ser 


Asp Leu 


Leu 


Met 


Arg 


164 


65 










70 










75 








80 


166 


Glu 


Ser 


Thr 


Glu 


Asn 


Val 


Pro 


Arg 


Thr 


Arg 


Leu 


Glu Asp 


Pro 


Ala 


Met 


167 










85 










90 






95 





169 Trp 

172 <210> SEQ ID NO: 7 

173 <211> LENGTH: 12 

174 <212> TYPE: RNA 

175 <213> ORGANISM: Homo sapiens 

177 <220> FEATURE: 

178 <221> NAME/KEY: misc_feature 

179 <222> LOCATION: (10) 

180 <223> OTHER INFORMATION: n is a u in the wildtype and c in the mutant 
182 <400> SEQUENCE: 7 

W--> 183 acaagcgacn gg 12 

186 <210> SEQ ID NO: 8 

187 <211> LENGTH: 9 

188 <212> TYPE: DNA 

189 <213> ORGANISM: Homo sapiens 

191 <220> FEATURE: 

192 <221> NAME /KEY: misc_f eature 

193 <222> LOCATION: (5) 

194 <223> OTHER INFORMATION: n is a t in the wildtype and c in the mutant 
196 <400> SEQUENCE: 8 

W--> 197 cgacngggg <j 

200 <210> SEQ ID NO: 9 

201 <211> LENGTH: 12 
2 02 <212> TYPE: DNA 

203 <213> ORGANISM: Homo sapiens 
205 <400> SEQUENCE: 9 

2 06 acaagcgacc gg 12 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/09/937 , 899A 



DATE: 09/03/2004 
TIME: 10:33:45 



Input Set : A:\lllSEQ.txt 

Output Set: N:\CRF4\09032004\l937899A.raw 



Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:7; N Pos . 10 
Seq# : 8 ; N Pos . 5 
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VERIFICATION SUMMARY DATE: 09/03/2004 

PATENT APPLICATION: US/09/937 , 899A TIME: 10:33:45 

Input Set : A:\111SEQ. txt 

Output Set: N:\CRF4\09032004\l937899A.raw 

L:183 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:7 after pos . : 0 
L:197 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:8 after pos . : 0 
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